Applying Grid Technologies to XML Based OLAP Cube Construction
نویسندگان
چکیده
On-Line Analytical Processing (OLAP) is a powerful method for analysing large warehouse data. Typically, the data for an OLAP database is collected from a set of data repositories such as e.g. operational databases. This data set is often huge, and it may not be known in advance what data are required and when to perform the desired data analysis tasks. Sometimes it may happen that some parts of the data are only needed occasionally. Therefore, storing all data to the OLAP database and keeping this database constantly up-to-date is not only a highly demanding task but it also may be overkill in practice. This suggests that in some applications it would be more feasible to form the OLAP cubes only when they are actually needed. However, the OLAP cube construction can be a slow process. Here, we present a system that applies Grid technologies to distribute the computation needed in the cube construction process. As the data sources may well be heterogeneous, we propose an XML language as an interim format for collecting the data. The user’s definition for a new OLAP cube often includes selecting and aggregating the data. In our system this computation is distributed to the computers that store the original data. This reduces the network traffic and speeds up the computation that is now performed in parallel. We have implemented a prototype for the system. The implementation uses software packages called Spitfire (a data base front end) and Mobile Analyzer (a Java distributed computing platform). Both of these have their background in Grid technologies.
منابع مشابه
XML encoding and Web Services for Spatial OLAP data cube exchange: an SOA approach
XML and Web Services technologies have revolutionized the way data are exchanged on the Internet. Meanwhile, Spatial OLAP (SOLAP) tools have emerged to bridge the gap between the Business Intelligence and Geographic Information Systems domains. While Web Services specifications such as XML for Analysis enable the use of OLAP tools in Service Oriented Architecture (SOA) environments, no solution...
متن کاملGMLA: A XML Schema for Integration and Exchange of Multidimensional-Geographical Data
The integration among DW, OLAP and GIS has been given considerable attention in recent years by many researchers and industrial corporations. This may be a result of: 1) DW/OLAP can improve GIS spatial queries whereas, 2) a GIS can provide better support to deal with the DW/OLAP geographic data. Some research about this integration has already been done. However, these approaches do not deal wi...
متن کاملA tool for data cube construction from structurally heterogeneous XML documents
Data cubes for OLAP (Online Analytical Processing) often need to be constructed from data located in several distributed and autonomous information sources. Such a data integration process is challenging due to semantic, syntactic, and structural heterogeneity among the data. While XML (Extensible Markup Language) is the de facto standard for data exchange, the three types of heterogeneity rema...
متن کاملAn interoperable XML encoding for the exchange of Spatial OLAP data cubes in SOA environments
XML and Web Services technologies have revolutionized the way data are exchanged on the Internet. Meanwhile, Spatial OLAP (SOLAP) tools have emerged to bridge the gap between the Business Intelligence and Geographic Information Systems domains. While Web Services specifications such as XML for Analysis enable the use of OLAP tools in Service Oriented Architecture (SOA) environments, no solution...
متن کاملXML-OLAP: A Multidimensional Analysis Framework for XML Warehouses
Recently, a large number of XML documents are available on the Internet. This trend motivated many researchers to analyze them multi-dimensionally in the same way as relational data. In this paper, we propose a new framework for multidimensional analysis of XML documents, which we call XML-OLAP. We base XML-OLAP on XML warehouses where every fact data as well as dimension data are stored as XML...
متن کامل